Picture for Yong Li

Yong Li

Tsinghua University

SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes

Add code
May 29, 2026
Viaarxiv icon

REVERSE: Reinforcing Evidence Verification and Search for Agentic Image geo-localization

Add code
May 26, 2026
Viaarxiv icon

UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs

Add code
May 12, 2026
Viaarxiv icon

NavOne: One-Step Global Planning for Vision-Language Navigation on Top-Down Maps

Add code
May 07, 2026
Viaarxiv icon

Resource-Constrained Robotic Planning in the face of Mixed Uncertainty

Add code
May 07, 2026
Viaarxiv icon

LoViF 2026 The First Challenge on Holistic Quality Assessment for 4D World Model (PhyScore)

Add code
May 06, 2026
Viaarxiv icon

iWorld-Bench: A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 06, 2026
Viaarxiv icon

A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 05, 2026
Viaarxiv icon

LLMs Reading the Rhythms of Daily Life: Aligned Understanding for Behavior Prediction and Generation

Add code
Apr 26, 2026
Viaarxiv icon

DLink: Distilling Layer-wise and Dominant Knowledge from EEG Foundation Models

Add code
Apr 16, 2026
Viaarxiv icon